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1  Introduction. 

Field  programmable  gate  arrays  (FPGA’s)  are  very 
useful  in  rapid  prototyping  as  well  as  small  volume  pro¬ 
duction  [3],  A  look-up  table  (LUT)  type  FPGA  shown 
in  Fig.  1.1  consists  of  LUT's  and  programmable  in¬ 
terconnection.  Both  LUT’s  and  programmable  inter¬ 
connections  are  controlled  by  static  RAMs.  We  as¬ 
sume  that  each  FPGA  has  q  LUT’s,  and  each  LUT 
can  realize  an  arbitrary  logic  function  of  k  binary  vari¬ 
ables.  We  also  assume  that  interconnection  resources 
are  sufficient,  i.e..  any  logical  network  with  q  LUT’s 
can  be  realized.  Note  that  wiring  is  implemented  by 
pass  transistors,  and  the  delay  time  for  interconnec¬ 
tion  will  often  be  larger  than  for  the  LUT’s.  Consider 
the  realization  of  a  moderately  complex  function.  If 
k  is  small,  e.g.,  k  —  2  or  3.  the  LUT’s  are  efficiently 
used,  but  the  interconnections  will  be  complex.  This 
will  be  especially  inefficient  since  interconnections  are 
often  more  expensive  than  logic.  If  k  is  large,  e.g., 
k  —  7  or  8,  the  interconnections  will  be  simpler,  but 
the  L  UT’s  are  not  efficiently  used.  Thus,  there  ex¬ 
ists  an  optimum  value  for  k  between  3  and  7.  One 
stud}'  [20]  shows  that  when  k  —  3  or  4,  FPGA’s  re¬ 
quire  the  smallest  chip  area.  However,  these  results 
have  not  been  used  by  manufacturers.  For  example, 
for  the  XILIXX  4000  Series  FPGA’s,  k  —  5,  and  for 
the  AT&T  ORCA  FPGA’s,  k  —  6  [1].  Various  design 
methods  for  LUT-based  FPGA’s  are  known: 

1)  Technology  mapping  from  AND- OR  multi-level 
logic  circuits  [3]. 

2)  Technology  mapping  from  binary  decision  dia¬ 
grams  (BDD’s)  [2]. 

3)  Functional  decomposition  [27,  12,  32]. 

Commercially  available  FPGA’s  contain  many  LUT’s, 
but  the  interconnections  are  much  slower  than  other 
standard  mask  type  gate-arrays.  Thus,  it  is  often  more 
important  to  reduce  the  propagation  delay  than  to  re¬ 
duce  the  number  of  LUT’s  [5,  1G] .  To  reduce  the  prop¬ 
agation  delay,  we  have  to  reduce  the  number  of  the 
levels  in  the  networks.  Also,  if  the  interconnections 
and  layout  are  regular,  then  the  propagation  delay  will 
be  smaller. 

In  this  paper,  we  propose  a  design  method  for  LUT- 
based  FPGA’s  that  produces  LUT  networks  having  the 


Figure  1.1:  LUT  type  FPGA 
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Figure  1.2:  LUT  network 

structure  shown  in  Fig.  1.2.  We  assume  that  each 
LUT  has  6  binary  inputs.  By  pairing  variables  of  two¬ 
valued  input  functions,  we  can  obtain  four- valued  input 
functions.  By  using  four-valued  logic,  we  can  reduce 
the  number  of  variables  to  consider,  and  can  design  a 
network  having  a  regular  structure.  We  use  the  pseudo- 
Kronecker  expansion  for  four- valued  input  two-valuecl 
output  functions  to  design  compact  and  regular  LUT 
networks.  The  proposed  method  is  also  promising  for 
the  four- valued  LUT  based  FPGA’s  [26,  14,  31]. 

2  Realization  of  Two- Valued  Input 
Functions  With  3-input  LUT’s 

Before  studying  the  realization  of  logic  functions 
with  6-input  LUT’s,  it  is  convenient  to  review  the 
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Figure  2.1:  Logic  circuits  corresponding  to  three  types 
of  expansions 

realization  for  two-valued  input  functions  by  3-input 
LUT’s.  Consider  three  expansions: 


/  =  .i'l/o  e  .('l/i,  (2.1) 

/  =  /o  ®  2:1/2,  and  (2.2) 

/  -  /l  9  .f'i/2-  where  /2  -  /0  ©  /i-  (2.3) 

(2.1),  (2.2),  and  (2.3)  are  called  the  Shannon  expan¬ 
sion,  the  positive  Davio  expansion,  and  the  negative 
Davio  expansion,  respectively.  Fig.  2.1  shows  the  logic- 
circuits  realizing  these  three  expansions;  they  are  de¬ 
noted  by  S,  pD,  and  11D,  respectively.  Note  that  a 
3-input  LUT  directly  realizes  these  circuits. 

When  the  Shannon  expansions  are  used  to  expand 
/o  and  /1 ,  we  have 

/o  =  a:2/oo  ©  2:2/01,  and 

fi  -  X2/10  ©2'2/n.  respectively. 

In  the  similar  way.  we  can  expand  the  new  sub- 
functions  as  follows: 


Figure  2.2:  Representation  of  logic  functions  using  the 
Shannon  expansion 
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Figure  2.3:  Realization  of  a  logic  function  using  multi¬ 
plexers 


/oo  -  .13/000  ©  2-3/001- 
/01  -  1:3/010  ©  .1-3/011- 
/10  =  I3/100  ©  .1-3/101-  and 
/11  =  I3/110  ©  .1-3/111- 

Fig.  2.2  is  the  binary  decision  tree  corresponding  to 
the  above  expansion.  If  we  replace  each  node  by  a 
two- input  multiplexer  (Fig.  2.1(a)),  we  have  the  net¬ 
work  shown  in  Fig.  2.3.  To  reduce  the  number  of  the 
multiplexers,  we  use  the  following  rules: 

1)  If  a  sub- function  is  a  constant  0  or  1,  terminate 
that  branch. 

2)  If  two  sub- functions  are  the  same,  i.e.,  if  /o  =  f\, 
then  extend  the  branch  down  to  the  next  level  (i.e., 
do  not  use  a  multiplexer). 

3)  If  the  current  sub- function  (/,;)  is  the  same  as  one 
already  generated  (/,-),  move  the  current  branch 
(for  fi )  over  to  the  other  branch  ( fj).  For  example, 
if  /01  is  the  same  as  /iq.  move  the  /iq  branch  over, 
thus  merging  the  two  nodes. 

A  decision  diagram  simplified  by  the  above  rule  is 
called  a  (reduced  ordered)  binary  decision  diagram 
(BDD).  For  a  given  function  and  a  given  order  of  the 
input  variable?,  the  BDD  is  unique. 


If  we  use  the  positive  Davio  expansion  instead  of  the 
Shannon  expansion,  then  we  have  (2.2).  When  we  use 
the  same  expansion  for  /q  and  /2,  we  have 

/o  =  /oo  0  .1-2/02-  and 
h  —  /20  0  .1-2/22- 

Fig.  2.4  is  the  tree  corresponding  to  this  expansion. 
Also,  in  this  case,  we  can  reduce  the  diagram.  Such 
a  diagram  is  called  the  functional  decision  diagram 
(FDD)  [11].  For  a  given  function  and  a  given  order 
of  the  input  variables,  the  FDD  is  unique.  In  a  simi¬ 
lar-  way,  we  can  consider  the  expansion  using  only  the 
negative  Davio  expansion.  Also,  we  can  consider  an 
expansion  using  either  the  positive  or  negative  Davio 
expansions  for  each  variable  [13]. 

Next,  consider  the  expansion  where  any  one  of  the 
three  expansions  is  permitted  for  each  variable  .  For 
example,  in  Fig.  2.5,  the  Shannon  expansion  is  used 
for  aq,  the  positive  Davio  expansion  is  used  for  a:2. 
and  the  negative  Davio  expansion  is  used  for  aq .  Such 
an  expansion  is  called  a  Kronecker  expansion  [6,  28]. 
The  reduced  diagram  is  the  Kronecker  decision  dia¬ 
gram  (KDD)  [18].  For  a  given  function  and  a  given 
order  of  the  input  variables,  there  are  at  most  3"  dif¬ 
ferent  KDD’s.  A  minimum  KDD  is  one  with  the  fewest 
nodes. 


Figure  2.4:  Representation  of  a  logic  function  using  the 
positive  Davio  expansion 


Figure  2.6:  Representation  of  a  logic  function  using  the 
Pseudo-Kronecker  expansion 


Figure  2.5:  Representation  of  a  logic  function  using  the 
Kronecker  expansion 
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Figure  2.7:  Relation  among  BDD’s.  FDD's,  KDD’s  and 
PKDD’s 

where  a =  1,  2,  ..,n)  takes  on  two  values. 

Definition  3.1  Let  Q  —  {0. 1,2,3}  and  B  —  {0,1}. 

/:  Qr  —a  B  is  a  four-valued  input  two- valued  out¬ 
put  function. 


Furthermore,  we  can  consider  the  following  expan¬ 
sion:  Any  one  of  the  three  expansions  is  permitted  for 
any  node.  For  example,  in  Fig.  2.6,  the  Shannon  ex¬ 
pansion  is  used  for  aq ,  the  positive  and  negative  Davio 
expansions  are  used  for  x->,  and  all  the  three  expan¬ 
sions  are  used  for  xg .  Such  an  expansion  is  called  a 
pseudo- Kronecker  expansion  [24,  28],  and  the  reduced 
diagram  is  called  a  pseudo- Kronecker  decision  diagram 
(PKDD).  For  a  given  function  /  and  the  given  order 
of  the  input  variables,  the  PKDD  with  the  minimum 
number  of  nodes  (LUT's)  is  the  minimum  PKDD  for 
/.  There  are  at  most  3U'  different  PKDD’s. 

The  relation  among  BDD,  FDD.  KDD  and  PKDD 
is  shown  in  Fig.  2.7.  This  shows  that  KDD’s  are  a 
special  case  of  PKDD's.  and  that  BDD’s  and  FDD’s 
are  each  a  special  case  of  KDD’s.  It  follows  that  when 
we  realize  a  given  function  by  3-input  LUT’s.  a  method 
based  on  a  PKDD  requires  the  fewest  LUT’s.  Fig.  2.8 
is  a  realization  of  the  function  in  Table  2.1  by  3-input 
LUT’s.  In  this  case,  we  used  only  the  Shannon  expan¬ 
sion  for  each  LUT.  However,  in  general,  we  can  use 
any  one  of  the  three  expansions  to  reduce  the  number 
of  LUT’s. 

3  Definitions  and  Basic  Properties 

We  assume  that  functions  to  be  realized  have  n  —  2 r 
inputs.  Let  X  —  (aq,  igg,  ■  ■  ■ ,  x„  )  be  the  input  variables, 


An  arbitrary  logic  function  of  n  variables  f(X)  can 
be  converted  into  a  four-valued  input  two- valued  out¬ 
put  function  as  follows:  /(AT,  A2,  •  ■  ■ ,  A',.)  =  /(A), 
where  A,-  =  (®2*-lvd2»)  takes  either  0,1,2,  or  3  iff 
(a’2i— l- x-2i)  —  (0,0).  (0,1).  (1,0),  or  (1,1),  respectively. 

Example  3.1  The  logic  function  shown  in  Table  2.1 
can  be  converted  into  the  four-valued  input  two-valued 
output  function  in  Table  2.2.  where  AT  =  (aq,aq)  and 
AT  =  (aq.aq). 

From  now  on,  a  four-valued  input  two-valued  out¬ 
put  function  will  be  simply  called  a  function,  and  an 
ordinary  two- valued  input  function  will  be  called  a  two- 
valued  input  function. 


Definition  3.2  Let  S  C  Q.  Xs  is  a  literal  of  X. 

where 

yS  _  f  0  if  X  iS 

A  -\1  If  x  e  s. 

When  S  contains  only  one  element.  A't*l  is  de¬ 
noted  by  X‘.  A  product  of  literals  A'f’Aif2  ■  ■  ■  A/f " 
is  a  product  term  that  is  the  AND  of  the  literals 
Xf1 .  X.f- ,  ■  ■  ■ ,  and  X%n .  A  sum  of  products 
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Figure  2.8:  Realization  of  the  function  in  Table  2.1 


is  a  sum-of-products  expression  (SOP),  where 
V(Sj.5’,.....5„)  denotes  the  inclusive- OR  of  product 
terms.  If  the  inclusive- OR  is  replaced,  by  exclusive- OR, 
the  res  ult  is  an  exclusive- OR  of  products 

£e  AfA'f 

(S!,S2,-,Sn) 

which  is  called  an  exclusive- OR  sum-of-products 
expression  (ESOP). 

Lemma  3.1  An  arbitrary  r-variable  function  can  be 
expanded  as 

/(A'i -  A'2 ,  *  ■  ■  -  Ar)  -  A? /( 0.  A'2 ■  ■  •  - ,  Xr ) 

VA'i  /( I-  A'a  ■  ■  ■  ■ ,  Xr)  V  X\  /( 2.  X2 ,  ■  ■  ■ ,  Ar) 
VX?/(3,X2,---Xr).  (3.1) 

This  is  the  Shannon  expansion  with  respect  to  A  . 


Lemma  3.2  /  is  uniquely  represented  as 

f-  V  f(a i .  a2 .  •  ■  • , an )Xf  Xf  ■  •  ■  A“- , 

(01,02  ,-":an) 

where  V(  )  is  the  inclusive- OR  for  all  the  com¬ 

binations  such  that  a ,  €  Q,  and  f(a\,  a2,  ■  ■  ■  *®n)  —  0 
or  —1.  f  can  be  also  uniquely  represented  by  an  ex¬ 
pression: 

f  =  £  /(«i •  ■  ■ ,  an )x? x?  ■  •  ■  a;;  , 

(  01,02  ,  "  ’ ,  0 1(  ) 

where  ^0(ui  a . a  )  represents  the  exclusive- OR  for 

all  the  combinations  of  a:  €  Q. 

4  Design  of  FPGA’s  Using  the  Shan¬ 
non  Expansion 

First,  we  will  show  a  naive  method  to  design  LUT 
networks,  where  each  LUT  has  six  inputs.  We  can 
realize  a  function  /  by  using  the  Shannon  expansion, 

/■:A  -A  j.A:;.  X4 )  =  A  " f,  V  A  /  V  A  -  V  A /■• .  (4.1) 

Fig.  4.1  shows  a  realization  of  the  expansion  in  (4.1). 
It  consists  of  one  LUT  that  drives  the  output  (called 
the  output  LUT)  and  four  other  LUT’s.  each  realizing 
/,;.  It  is  convenient  to  view  the  two  binary  control 
inputs  as  a  single  four-valued  input.  In  this  example, 
the  control  input  of  the  output  LUT  is  driven  by  A  | . 
while  X|  drives  the  control  inputs  of  the  other  four 
LUT’s.  Fig.  4.1  shows  that  some  combination  of  X3 
and  A'4  drives  the  primary  inputs  of  these  four  LUT’s. 

The  following  observations  will  reduce  the  number 
of  LUT’s: 

1.  If  /,;(*  =  0, 1,2,  or  3)  is  a  constant  function,  then 
the  corresponding  LUT’s  are  not  needed.  That  is, 
no  special  hardware  is  necessary  for  the  constants. 

2.  If  /,:  =  fj(i  ^  j),  then  one  LUT  can  be  used  for 
both  /,;  and  /,. 

The  diagram  corresponding  to  the  LUT  network  is 
called  a  QDD  (Quarternary  Decision  Diagram).  We 
can  design  an  LUT  network  using  an  algorithm  that 
is  similar  to  that  used  for  Binary  Decision  Diagrams 
(BDD’s):  If  two  sub-functions  are  the  same,  realize 
only  one  sub-function.  If  a  sub-function  involves  6  or 
fewer  variables,  stop  (it  is  realized  by  a  single  LUT). 

Example  4.1  Let  f  be  the  following  function: 

f  ( Xi ,  X2 ,  A;; ,  X4 )  =  A?  ( xf  0  xf )  V  Xl  ( x$  e  xj ) 
VX^A'i  0Xf)  VXfX'l, 
where  0  is  the  modulo  2  sum. 

If  f  is  represented  in  the  form.  ()■!),  we  have 

/o  =  A"|  ©Xf,  /i  =  Xf  ©Xf,  f-2  —  Xf  ©Xf,  f)  —  Xf. 

In  this  case,  all  the  sub-fimctions  are  non- constant  and 
distinct.  Fig.  ).l  shows  a  realization  for  f. 
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Figure  4.1:  Realization  of  a  logic  function  by  the  Shan¬ 
non  expansion 


Figure  4.2;  Network  structure  for  k  —  6 

5  Enumeration  of  LUT’s  to  Realize 
Given  Functions. 

Here,  we  consider  the  question  of  how  many  LUT's 
are  required  in  the  realization  of  a  given  function. 

Theorem  5.1  Let  /  be  an  arbitrary  r-variable  four- 
valued,  input  two-valued  output  function.  When  r  <  3, 
/  can  be  realized  by  a  single  LUT  with  six  two-valued 
inputs.  Wien  r  >  4,  f  can  be  realized  with  at  most 

LUT's. 

(Proof)  Because  an  LUT  realizes  any  function  with  3 
inputs,  a  single  LUT  is  sufficient.  Any  r-variable  func¬ 
tion,  where  r  >  3  can  be  realized  by  an  LL  T  tree 
structure  of  the  type  shown  in  Fig.  4.2.  where  one 
four- valued  variable  is  applied  at  each  level,  except  the 


leftmost  level,  where  three  variables  are  applied.  The 
number  of  L  L  i  t  required  is  at  most 

41'-2  _  1 

1  +  41  +  42  +  •  •  •  +  4r~:1  =  - - . 

o 

Q.E.D. 

The  above  theorem  gives  an  upper  bound.  However, 
in  many  cases,  given  functions  can  be  realized  with 
fewer  LUT’s. 


Theorem  5.2  Given  a  function  /(A'i,  X‘<-  •  ■  • ,  A”,.),. 
let  p  be  the  number  of  distinct  functions  of  the  form 
f  (cq.  a-).  ,  c  r  ; ; .  A; — 2 ■)  A f — l .  A;-),  where  a £  () ■  %  — 

1,2,  ■  ■  •  ,  r  —  3.  Then,  f  can  be  realized  with  at  most 

T  :1  -  1 

- - - -p  LUT’s. 

o 

(Proof)  /  can  be  expanded  as 

f{Xi,X 2,---,-Yr)  = 

V  A"”1  AAJ  ■  •  ■  X°I_z3  f(ai,a,2,  ■  ■  *> a,._3 ,  A',—2 ,  A',—i ,  A ,. ), 
where  £  Q,  i  —  1, 2,  ■  ■  ■  ,f  —  3. 


We  can  realize  a  selector  circuit  that  selects  one 
out  of  4r~:1  inputs  by  using  4'  (w1  LUT’s.  Be¬ 
cause  there  are  p  distinct  functions  of  the  form 
f(a i ,  a 2 .  ■  ■  ■ ,  «.r_3 .  Xr-2 ■  -Ar_i ,  A',.),  we  need  to  realize 
only  p  such  functions.  Q.E.D. 

6  A  Design  Method  Using  the  Pseudo- 
Kronecker  Expansion. 

The  design  method  using  the  Shannon  expansion  is 
simple,  but  requires  many  LL IT’s.  This  is  because  each 
LUT  is  only  used  as  a  multiplexer,  while  it  can  realize 
any  of  264  different  functions.  To  reduce  the  number 
of  LLTT’s,  we  can  use  the  theory  of  pseudo- Kronecker 
expansions,  which  was  developed  for  the  optimization 
of  ESOPs  with  four- valued  inputs  [23,  24]. 

Definition  6.1  Let  S  be  a  set  of  logic  values,  where 
S  C  Q.  The  characteristic  vector  of  S  is  a  — 

(q'O,  Q'i,  Q'2,  ci'3),  where  ctj  is  a  logic  value  such  that 


Qti  — 


f  0  if  i  i  s 

\  1  if  ieS. 


Definition  6.2  .4  matrix  M 


M 


Ofl 

«i 

ai 

«:i 


of  characteristic  vectors  a,;  is  non-singular  if  there 
exists  a  unique  matrix  M_1  such  that 


MM-1 


1000 

0100 

0010 

0001 


=  /, 


where  multiplication  is  AND  and  addition  is  exclusive- 
OR. 


If  one  expands  a  given  function  /:  Qr  — i  B  about 
variable  A'  using  the  Shannon  decomposition, 

7  =  A°/o  0  A'1/!  0  A-/.  0  A:;/:|. 

then  /o,  /i,  f-2 ,  and  f3  are  unique.  The  next  theorem 
[23]  shows  that  for  certain  other  forms  of  A  * .  the  sub- 
functions  are  also  unique.  The  same  theorem  can  be 
also  found  in  [19]  and  [24]. 

Theorem  6.1  (Expansion  Theorem)  An  arbitrary 
function  /  :  Qr  — »  it  caw  7e  expanded  with  respect 
to  a  variable  X  as 

f  =  XSo  h0  0  ASl  hi  ©  As!  h-2  0  XSs  h3  -  (6.1) 

where  ho,  hi,  h-2,  and  h3  are  unique  if  and  only  if  M 
is  non- singular ,  where 

.  and  <m  (i  =  0, 1,2,3)  are 
the  characteristic  vectors  of  Si-  ' 


M  - 


«o 

ai 

a-i 

a.) 


(Proof)  By  (6.1),  we  have 


Example  6.1  Consider  the  four-valued  input  two¬ 
valued  output  function  f  in  Table  2.2.  By  the  Shannon 
expansion,  it  can  be  represented  as 


f  -  X W/{0}  0  X 0  a@>/{2}  0  A®3  7(3}- 


Let  us  consider  the  expansion  of  the  form  f  —  XAfjii  0 
XB  fBj  ©  X C  fc  0  X°fD,.  In  this  case,  the  sub- 
functions  obtained  by  EXORing  sub -functions  are  as 
follows: 


f  _  v  :|I:L  r  _  y  i11  -  r 

7(0}  —  A2  ■'  7(1]  —  A2 

f  _  wl1,3}  f  — 

7(2}  —  -%  ,  7(3}  —  A-2 

7{0.1j  —  A2  ’  7(0,2}  —-'■2  ’  7(0,3}  —  a2 


_  Y (0.1 .2.3} 


•  7(1,3} 


=  A<1’2> 


7(1,2}  -  - v2 

7(1.2, 3}  =  A2{2,3},  7(0.2, 3}  =0, 

/{ 0,1,3}  —  A2  5  / {0 ,1,2}  —  A2  5 


■  7(2,3} 


_  x(0,3} 


7(0, 1,2 ,3}  =  X-2 


(0,2} 


/  =  ASo/i0  0  XSlln  0  A®  //  2  0  A  %3. 


O11  the  other  hand,  by  the  Shannon  expansion  of  /,  we 
have 

7  =  A°7o  0  A1/!  0  A2 72  0  A 3 ./I;. 

Let  I  be  the  unit  4x4  matrix.  Since  the  two  expansions 
above  represent  the  same  function, 

[ho ,  7 1 ,  h-2  ■  7  3  ]  M  —  [7o ,  /1 ,  f-2 , 73  ]  7. 


(if)  When  M  is  noii-singular,  the  inverse  matrix  M-1 
exists,  and  the  function  can  be  uniquely  represented 
with 


[ho  ,hi,  h2 ,  hi  ]  =  [7b  ■  7i  7  f-2 , 73  ]  M  1 . 


(only  if)  When  M  is  singular,  [//„.//  .  h-2- h:\  cannot  be 
uniquely  represented.  Q.E.D. 


Corollary  6.1  Let  A,  B ,  C ,  D .  A! ,  B' ,  C  and  D'  be 
subsets  of  Q  —  {0, 1,2,3},  and  let  a,  b,  c,  d.  a1,  b' . 
c!  and  d' .  respectively .  be  the  characteristic  vectors  for 
them. 

ui 


Let  M  — 


b 

c  ' 


d 


R  —  [a7*,  b'  ,  c7*,  d  ]  ,  and  MR  —  I  (unit  matrix). 
Then,  an  arbitrary  fo  ur-valued  input  two-valued  output 
function  f  can  be  uniquely  represented  in  the  forms 


7  =  A{0}7(o}  0  A«/{i}  0  A<2>/{2}  0  A <3>/{3}  or 
=  A'-4 7.4'  0  XBfBt  0  X c  fc  0  XDfo>,  where 

7.4'  =  ^2  ®7{*> ■  7 b'  —  ^2  ®7{*}’  7c' 

*€A'  ie-B' 

=  X!  ®7{*}’  7b'  =  ^  07(i} ■ 

icC  i£D' 


The  number  of  products  needed  to  represent  the  above 
sub-functions  are  0  for  7(023 }  •  and  7  for  the  others. 
The  characteristic  vectors  for  the  four  sub-functions 
7(023}  -  7(1} :  7(2}  and  7(3}  are  (1011),  (0100).  (0010). 
and  (0001),  respectively.  Note  that  these  vectors  are 
linearly  independent  of  each  other.  Hence,  f  can  be 
uniquely  represented  by  these  four  sub- functions.  Be¬ 
cause 


"1000" 

"1000" 

R  = 

0100 

1010 

and  AI  —  R  1  — 

0100 

1010 

1001 

1001 

f  can  be  represented  as 

f  -  A  7.,  0  XB  fBi  0  Af  fa  0  Aj°  fD% 

-  A1{0} 7(0 ,2, 3}  0  xPf{i}  0  a1{0,2}7{2}  0  a1{0,3}7{3}, 

-  A1{1]A2{0’3}  0  a1{0’2]A2{1’3}  0  A1{0’3}A2{0’1}. 

Note  that  this  expansion  has  only  three  product  terms. 

( End  of  Example ) 

A  computer  enumeration  shows  that,  out  of  the 
(24)4  =  216  possible  4x4  matrices.  20160  are  11011- 
singular  [23].  If  we  ignore  the  labeling  of  logic  values  for 
one  of  the  variables,  there  are  essentially  2U4,0U  =  840 
different  expansions,  of  which  the  Shannon  expansion 
is  one.  This  leads  to  the  question  of  which  expansion 
reduces  the  complexity  of  the  network.  We  can  select 
an  expansion  that  reduces  the  complexity  of  the  net¬ 
work. 

Definition  6.3  Pseudo- Kronecker  decision  dia¬ 
grams  (PKDD’s)  of  n -variable  four-valued  input  two- 
valued  output  functions  are  defined  recursively  as  fol¬ 
lows: 


1)  A  PKDD  is  a  terminal  node  v  labeled  by  value (v), 
where  valuer)  G  {0,1}. 

2 )  A  PKDD  is  a  non-terminal  node  v  that  has  four 
children.  ho(v).  hifv),  /?2 ( t’ )->  and  hs(v),  that  are 
also  PKDD's,  where  v  is  labeled  by 

(a)  index(u),  where  index(v)  G  {1,2,- and 

by 

(b)  a  4  x  4  regular  matrix  M,  where  M  is  defined 
■in  (6.2). 

The  correspondence  between  a  PKDD  and  a  Boolean 
function  is  defined  as  follows: 

A  PKDD  G  having  a  root  node  v  denotes  a  function 
fv  defimed  recursively  as: 

3)  If  v  is  a  terminal  node: 

(a)  If  value)  v)  —  1.  then  f,  —  1. 

(b)  If  valuefv)  —  0.  then  fv  —  0. 

4)  If  v  is  a  nonterminal  node  with  index(v)  —  i.  then 
fv  is  a  function 

X?°fho(v)  ®  XtSlfhl(v)  ®  Xf2  fh 2(v)  ©  Xf3fhs(v), 

such  that 


M  - 


ao 

ai 

a, 

«3 


is  non-singular. 


where  a,  is  a  characteristic  vector  for  Sj  (  i  —  0, 1,2,3). 
A',:  is  called  the  decision  variable  for  node  it. 


1 )  Let  /  be  the  function.  Expand  it  in  the  form 
f  =  Xf°/i.0  ©  A flhx  ©  A  ®  h2  ©  A 
that  minimizes  the  cost  of  the  expansion,  where 


3 

COST(f  )  =  J2C0ST(hj)'  and 

3=0 


COST  (hi)  - 


'  0  if  hj  is  constant 
0  if  hj  is  already  realized 
k  if  hj  is  not  realized  previously 
and  depends  on  k  variables 


2)  Create  the  nodes  for  new  functions  if  they  do  not 
exist.  Record  that  they  are  realized. 

3 )  Expand  the  remaining  nodes  in  the  same  way. 

Example  6.2  Consider  the  )- valued  input  function  f 
in  Example  3.1. 

fiX  .Xa.X,.X:)  -  Af^Xf  ©  A|)  V  A-f^Af  ©  Xf) 
VXx{2}(X|  ©  Xf )  v  xf3}xf. 

This  expansion  has  four  non-zero  sub-functions.  How- 
everf  we  can  reduce  the  number  of  non-zero  sub- 
functions  by  considering  an  expansion  with  the  form 

f  -  A  7,  ©  A  "/a-  ©  A"  /,  ©  A"/,r. 

To  find  a  good  expansion,  we  calculate  the  costs  for  all 
the  15  sub -functions. 


Definition  6.4  A 

PKDD  such  that: 


reduced  ordered  PKDD 


1)  In  Definition  6.3,  for  any  non-terminal  node  v. 

if  hifv)  is  also  nonterminal,  then 

indetxf v )  < indexfh ;  (v ) ) . 

2 )  No  two  subgraphs  in  the  PKDD  are  identical. 

The  PKDD  for  f  is  said  to  be  minim  um  if  it  contains 
the  least  number  of  nodes. 

It  is  very  difficult  to  obtain  ail  exact  minimum 
PKDD  ,  so  we  will  consider  a  heuristic  method  which 
obtains  a  good  solution  quickly.  In  this  case,  we  have 
to  consider  the  following  15  sub-functions: 


/{«  © 

/{12} 

/ {012}  =  /{ 0}  ®  /{ 1}  ®  /{ 2©  /{ 013} 

~  / {0 }  ®  /{I }  ®  /{3}’ 

/{023f  =  /{0}  ®  f  {2}  ®  /{3}»  /{ 1 2 3 } 

~  /{l}  ®  /{2}  ®  /{3}’ 

/ {0 12 3 }  =  /{ 0}  ®  /{ 1}  ®  / {2 }  ®  / (3 }  • 

Algorithm  6.1  ( Generation  of  a  PKDD  :  Simple 
■method  ) 

0)  For  i  —  1  to  r  —  3  do  the  following: 


/{ 0}  =  / {1.2}  =  X)  ©  Xf 

,  cost  —  2. 

/{ 1}  -  /{o,2}  -  X)  ©  A.f 

,  cost  —  2. 

/ {2 }  =  /{0,1}  =  X-2  ©  -Yf 

.  cost  —  2. 

/{3}  *7{0, 1,2,1}  =-Y| 

.  cost  —  1. 

/ {0.3}  —  / {1.2.3}  =  -A)  ©  Af  ©  A.f 

,  cost  —  3. 

/  {1 ,3  }  =  /  {0 ,2,3}  =  -Xj 

,  cost  —  1. 

/{2 ,3  }  -  /{ —  Af 

,  cost  --  1. 

/  (0 ,1,2}  =  0 

,  cost  —  0. 

Note  that  fit,  fn,,  fc,  and  fn>  have  smaller  costs, 
where  A'  =  {2,3},  B'  =  {1,3},  C  =  {3},  and  D'  = 
{0. 1.  2}.  The  characteristic  vectors  for  A! ,  B' ,  C ,  and 
D'  are:  HUU.li.  (0.1. 0,1),  (0,0, 0,1),  and  (1,1, 1,0), 
and  they  are  linearly  independent.  Thus,  we  have  R 
and  M  as  follows: 


=  /{0}  ®  /{ 1}-  / {02}  =  /{«}  ®  /{2 }  ,  / {03} 

"0001" 

"1010" 

=  /{0|  ®  / {3  }  *  ....  R_ 

0101 

, 

1100 

—  f{  1}  ®  /{ 2}-  /{13}  =  /{ 1}  ®  / (3 }  ■?  /{23}  U  ~ 

1001 

and  M  =  R  — 

0111 

~  / {2 }  ®  /{3}* 

1110 

1000 

Because  A  =  {0.2},  B  —  {0,1},  C  —  {1,2,3},  and 
D  —  {0},  /  can  be  represented  as 

f  -  xffA,  ©  x?fB,  ©  x?fv  e  x°m 


_  x{0,2} 


r^/{2,3)  ®  4°’1}/{1,3}  ®  X^*ff{ 3} 


(1,2,3}  . 


®-4°7{0,l,2}, 

_  x{0,2}  a-(3] 


X®JJ  ©  x{0’1}Xo{3}  ©  X 


-{1,2,3}X(3} 


Note  that  the  last  expansion  contains  only  three  non¬ 
zero  subfunctions.  (End  of  Example) 

7  Experimental  Results  [29] 

7.1  For  3-input  LUT’s  (2- valued  case) 

Table  7.1  compares  the  number  of  non-terminal 
nodes  for  BDD's,  FDD's.  FDD’s,  and  PKDD's.  For 
FDD’s,  the  column  headed  with  ”pD”  denotes  the  case 
where  only  positive  Davio  expansions  are  used,  and 
’’both”  denotes  the  case  where  both  positive  and  nega¬ 
tive  Davio  expansions  are  used.  For  each  diagram,  we 
obtained  an  ordering  of  input  variables  that  reduce  the 
number  of  nodes.  The  orderings  of  the  input  variables 
are  obtained  by  a  simulated  annealing  method  similar 
to  [8].  Note  that  the  orderings  that  minimize  the  size 
of  BDD’s  do  not  always  minimize  the  size  of  FDD’s, 
KDD’s  or  PKDD’s. 

Let  size(BDD),  size(FDD),  size(KDD)  and 
size(PKDD)  be  the  numbers  of  non-terminal  nodes  for 
optimized  BDD,  FDD.  KDD  and  PKDD  for  a  func¬ 
tion,  respectively.  The  experimental  results  show  that 

size(PKDD)<size(KDD)  <sizc(BDD), 

size(KDD)  <size(FDD:both)  <size(FDD:pD), 
size(FDD:both)  <size(FDD:nD). 

These  results  are  consistent  with  the  relation  of  deci¬ 
sion  diagrams  in  Fig.  2.7.  The  bottom  row  of  the  Table 

7. 1  shows  the  relative  sizes  of  the  diagrams.  On  the  av¬ 
erage,  PKDD’s  require  29%  fewer  nodes  than  BDD’s. 
However,  the  nodes  for  BDD’s  and  FDD’s  are  compa¬ 
rable. 

7.2  For  6-input  LUT’s  (4- valued  case). 

The  bit  pairing  algorithm  [22]  for  PLA’s  with  de¬ 
coders  was  used  to  produce  four-valued  input  two- 
valuecl  output  functions.  Table  7.1  also  compares  the 
number  of  non-terminal  nodes  for  QDDs  (Quarternary 
Decision  Diagrams)  and  4- valued  PKDD’s.  QDDs  were 
obtained  by  the  4- valued  extension  of  Shannon  Expan¬ 
sion  in  (3.1).  Details  of  the  optimization  algorithm  is 
shown  in  Appendix.  On  the  average,  QDDs  require 
■37%:  fewer  nodes  than  BDD’s,  and  4-valued  PKDD’s 
require  51%  fewer  nodes  than  BDD's.  Also.  4- valued 
PKDD’s  require  23%  fewer  nodes  than  QDDs. 

8  Conclusions  and  Comments 

In  FPGA  design,  interconnections  are  often  more 
expensive  than  logic.  FPGA's  using  3-input  LUT’s  re¬ 
quire  many  logical  levels  and  complex  interconnections. 
On  the  other  hand,  FPGA’s  using  6- input  LUT’s  re¬ 
quire  fewer  interconnections  and  fewer  logical  levels. 

In  this  paper,  we  showed  a  method  to  represent 
logic  functions  by  using  pseudo-Kronecker  diagrams 
(PKDD’s).  Experimental  results  show  that  2-valued 
PKDD’s  require  29%  fewer  nodes  than  BDD’s,  and  4- 
valuecl  PKDD’s  require  23%:  fewer  than  QDDs,  the  4- 
valued  extension  of  BDD’s.  Thus,  this  method  is  useful 
for  the  design  of  FPGA’s  with  6-input  LUT’s.  How¬ 
ever,  when  LUT’s  have  less  than  6  inputs,  this  method 
is  not  applicable. 


In  the  practical  design  of  FPGA’s,  we  can  further 
reduce  the  number  of  LUT’s  as  follows: 

1)  Use  complement  edges  [15,  7]. 

Consider  the  decision  diagrams  where  each  edge 
can  complement  the  sub-function.  This  corre¬ 
sponds  the  fact  that  LUT’s  can  complement  the 
inputs  with  no  extra  cost.  In  the  case  of  2-valued 
BDD’s.  complement  edges  can  reduce  the  number 
of  nodes  by  30  to  50%  [15]. 

2 )  Remove  redundant  LUT’s. 

For  a  3- input  LUT.  if  both  of  two  sub- functions  are 
constants,  then  it  realizes  a  variable  or  its  comple¬ 
ment.  Thus,  this  LUT  can  be  removed,  and  the 
variable  is  used  as  the  output.  If  an  LUT  depends 
on  at  most  three  variables,  then  it  can  be  realized 
with  one  LL~T.  For  a  6-input  LUT.  if  it  depends 
on  at  most  six  variables,  then  it  can  be  realized 
with  one  LUT. 

3)  Merge  several  LUT's  into  one. 

For  6-input  LUT’s.  if  any  of  the  sub-functions 
is  a  constant,  then  the  function  is  realize  by  an 
LUT  with  5  or  fewer  inputs.  In  AT&T  ORCA 
series  FPGA’s,  an  LUT  can  be  configured  to  real¬ 
ize  either  a  6-input  function,  or  a  pair  of  5-input 
functions,  or  four  4-input  functions.  Thus,  LUT’s 
with  fewer  inputs  can  be  merged.  Except  for  the 
6-input  case,  some  of  the  inputs  must  be  shared 
among  the  functions. 

We  assumed  that  the  FPGA’s  are  all  binary,  but 
the  control  inputs  for  LUT’s  can  be  four- valued  inputs 
instead  of  pairs  of  binary  variables.  This  will  signif¬ 
icantly  reduce  the  connections.  Thus,  this  method  is 
also  useful  for  a  design  of  multiple-valued  LUT  type 
FPGA’s. 

[9]  shows  a  method  to  realize  a  logic  function  by 
using  multiple- valued  multiplexers,  where  only  circuits 
with  tree  type  structure  are  generated.  Also,  intercon¬ 
nections  are  more  complex,  since  each  level  may  have 
different  control  variables. 
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Appendix 

Algorithm  8.1  (4-valued  PKDD  Simplification ) 

1.  Simplify  QDDs  by  vermutinq  the  input  variables 
(Algorithm  8.2). 

2.  Simplify  4 -valued  KDD’s  by  changing  the  method 
of  expansion  for  each  variable  (Algorithm  8.3). 

3.  Simplify  4 -valued  PKDD’s  by  changing  the  ■ method 
of  expansion  for  each  node  (Algorithm  8-4). 

Algorithm  8.2  ( Optimization  of  input  ordering  for 
QDDs). 


Table  7.1:  Number  of  nodes  in  Decision  Diagrams 


function 

in 

out 

BDD 

FDD 

KDD 

PKDD 

4- valued 

PD 

both 

QDD 

PKDD 

9sym 

9 

1 

33 

27 

27 

27 

26 

17 

17 

add6 

12 

7 

47 

23 

23 

23 

23 

17 

12 

adr4 

8 

5 

29 

15 

15 

15 

15 

11 

8 

alu2 

10 

6 

61 

94 

77 

60 

59 

48 

38 

apla 

10 

12 

103 

188 

124 

103 

94 

78 

76 

bw 

5 

28 

108 

116 

106 

103 

90 

71 

56 

clip 

9 

5 

97 

87 

85 

85 

69 

52 

34 

co  14 

14 

1 

27 

26 

26 

26 

26 

13 

13 

coni 

7 

2 

15 

17 

17 

15 

14 

11 

11 

dc2 

8 

7 

64 

77 

71 

63 

54 

44 

37 

dist 

8 

5 

152 

167 

163 

152 

127 

75 

71 

dkl7 

10 

11 

63 

98 

71 

63 

62 

47 

45 

dk27 

8 

9 

25 

24 

23 

23 

22 

25 

25 

duke2 

22 

29 

373 

367 

351 

332 

269 

257 

221 

£51  in 

8 

8 

67 

38 

35 

35 

28 

46 

22 

inc 

7 

9 

70 

78 

75 

65 

58 

41 

37 

rnisexl 

8 

7 

36 

42 

39 

35 

31 

27 

24 

misj 

35 

14 

43 

46 

44 

43 

43 

26 

26 

mlp4 

8 

8 

141 

108 

108 

108 

99 

81 

57 

radd 

8 

5 

29 

15 

15 

15 

15 

11 

8 

rd53 

5 

3 

23 

13 

13 

13 

13 

11 

7 

rd73 

7 

3 

43 

21 

21 

21 

21 

21 

11 

rd84 

8 

4 

59 

29 

29 

29 

29 

30 

15 

rise 

8 

31 

67 

62 

59 

59 

56 

57 

51 

rot8 

8 

5 

75 

89 

83 

74 

60 

41 

36 

sao2 

10 

4 

85 

98 

97 

84 

69 

50 

36 

sex 

9 

14 

47 

46 

45 

41 

41 

38 

35 

sqr8 

8 

16 

233 

201 

201 

201 

169 

147 

109 

t481 

16 

1 

32 

23 

17 

17 

17 

15 

10 

tial 

14 

8 

690 

543 

494 

456 

381 

392 

212 

tslO 

22 

16 

146 

156 

156 

132 

132 

148 

148 

xor5 

5 

1 

9 

5 

5 

5 

5 

5 

3 

z5xpl 

7 

10 

68 

53 

41 

41 

33 

40 

24 

totcll 

3160 

2992 

2756 

2564 

2250 

1993 

1535 

ratio 

1.00 

0.95 

0.87 

0.81 

0.71 

0.63 

0.49 

1.  Let  X\,  X->,  •••,  Xr  be  the  original  ordering  for 
the  QDD. 

2.  For  i  —  1  to  r  —  1 
{  For  j  —  2  to  r 

exchange  A';  with  Xj.  and  count  the  number  of 
nodes  in  QDD. 

Adopt  the  ones  that  minimize  the  number  of 
nodes  } 

3.  Repeat  2.  while  the  n  umber  of  nodes  is  reduced. 

Algorithm  8.3  (Simplification  of  4-valued  KDD's). 

1.  Let  A i .  X-) .  ■  A,,  be  the  ordering  obtained  by 

Algorithm  8.2. 

2.  Construct  a  FDD  (Pentadecimal  Decision  Dia¬ 
gram )  that  has  15  children  for  each  non-terminal 
node,  where  each  child  corresponds  to  one  of  15 
sub -functions. 


3.  Let  QDD  be  the  initial  4-valued  KDD. 

4.  For  i  —  1  to  r  —  1 

Expand  the  sub-functions  with  respect  A.;  in  840 
different  ways,  arid  find  one  that  minimizes  the 
number  of  nodes. 

5.  Repeat  4 ■  while  the  number  of  nodes  is  reduced. 
Algorithm  8.4  (Simplification  of  4-valued  PKDD's). 

1.  Let  X\,  X2 ,  •  j X,  be  the  ordering.  Let  the  4~ 
valued  KDD  obtained  by  Algorithm  8.3  be  the  ini¬ 
tial  4-valued  PKDD. 

2.  For  i  —  1  to  r  1 

For  each  node  for  Xj  .  expand  the  sub-functions  in 
840  different  ways,  and  find  ones  that  minimizes 
the  number  of  nodes. 

3.  Repeat  2.  while  the  number  of  nodes  is  reduced. 
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